Online Data Fusion
نویسندگان
چکیده
The Web contains a significant volume of structured data in various domains, but a lot of data are dirty and erroneous, and they can be propagated through copying. While data integration techniques allow querying structured data on the Web, they take the union of the answers retrieved from different sources and can thus return conflicting information. Data fusion techniques, on the other hand, aim to find the true values, but are designed for offline data aggregation and can take a long time. This paper proposes SOLARIS, the first online data fusion system. It starts with returning answers from the first probed source, and refreshes the answers as it probes more sources and applies fusion techniques on the retrieved data. For each returned answer, it shows the likelihood that the answer is correct, and stops retrieving data for it after gaining enough confidence that data from the unprocessed sources are unlikely to change the answer. We address key problems in building such a system and show empirically that the system can start returning correct answers quickly and terminate fast without sacrificing the quality of the answers.
منابع مشابه
In silico fusion of epsilon and beta toxin genes of Clostridium perfringens types D and B
Fusion protein technology represents the strategy to achieve rapid, efficient, and cost-effective proteinexpression. Epsilon and Beta toxins are the most potent Clostridial toxins and cause disease in animals.This study describes in silico fusion of Clostridium perfringens types D and B epsilon and beta toxin genesthat was used for cloning in E.coli. The etx and cpb genes were...
متن کاملDesigning and analyzing the structure of Tat-BoNT/A(1-448) fusion protein: An in silico approach
Clostridium botulinum type A (BoNT/A) produces a neurotoxin recently found to be useful as an injectable drug for the treatment of abnormal muscle contractions. The catalytic domain of this toxin which is responsible for the main toxin activity is a zinc metalloprotease that inhibits the release of neurotransmitter mediators in neuromuscular junctions. A cell penetrating cationic peptide, Tat, ...
متن کاملRole of Minimally Invasive Spine Surgery in Adults with Degenerative Lumbar Scoliosis: A Narrative Review
Background and Aim: Degenerative lumbar scoliosis is a spinal deformity resulting from advanced disc degeneration and facet arthropathy. Given the inconclusive available literature and lack of high-quality data supporting the role of minimally invasive surgical management of degenerative lumbar scoliosis, this review intends to highlight and compare the various viable minimally invasive surgica...
متن کاملDesigning and Analyzing the Structure of DT-STXB Fusion Protein as an Anti-tumor Agent: An in Silico Approach
Background & Objective: A main contest in chemotherapy is to obtain regulator above the biodistribution of cytotoxic drugs. The utmost promising strategy comprises of drugs coupled with a tumor-targeting bearer that results in wide cytotoxic activity and particular delivery. The B-subunit of Shiga toxin (STxB) is nontoxic and possesses low immunogenicity that exactly binds to t...
متن کاملReview of "Mathematical Techniques in Multisensor Data Fusion" by David L. Hall and Sonya A. H. McMullen
Data fusion has been a trend in the field of imaging and signal/image analysis. Although multisensor data fusion is still not regarded as a formal professional discipline, tremendous progress has been made since the publication of the first edition of this book in 1992. With this second edition, the authors have been successful in updating us with state-of-the-art methods and techniques in mult...
متن کاملMulti-source Information Fusion Based on Data Driven
Take data driven method as the theoretical basis, study multi-source information fusion technology. Using online and off-line data of the fusion system, does not rely on system's mathematical model, has avoided question about system modeling by mechanism. Uses principal component analysis method, rough set theory, Support Vector Machine(SVM) and so on, three method fusions and supplementary, th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 4 شماره
صفحات -
تاریخ انتشار 2011